Lightweight asynchronous scheduling in heterogeneous reconfigurable systems

نویسندگان

چکیده

The trend for heterogeneous embedded systems is the integration of accelerators and general-purpose CPU cores on same die. In these integrated architectures, like Zynq UltraScale+ board (CPU+FPGA) that we target in this work, hardware support shared memory low-overhead synchronization between accelerator make case exploring strategies exploit a tight collaboration CPUs accelerator. paper propose novel lightweight scheduling strategy, FastFit, targeted to FPGA accelerators, new scheduler based it, named MultiFastFit, which asynchronously tackles comprised variety IPs. Our strategy significantly reduces overhead automatically compute near-optimal chunksizes when compared previous state-of-the-art auto-tuned approach, makes our approach more suitable fine-grained applications. Additionally, MultiFastFit has been designed enable efficient co-execution work among devices such way all are busy while minimizing load unbalance. approaches have evaluated using four benchmarks carefully tuned low-power platform. experiments demonstrate FastFit always finds chunksize any device configuration at reasonable cost, even irregular applications, CPU+FPGA co-executions usually faster energy than CPU-only FPGA-only executions. We also with other strategies, finding it outperforms up 2x achieves similar results manually-tuned schedulers without requiring an offline search ideal CPU-FPGA partition or chunk granularity.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Lightweight Theorem Proving in an Asynchronous Systems Context

As part of the development of a new real-time operating system, an asynchronous communication mechanism, for use between applications, has been implemented in a programming language with an advanced static type system. This mechanism is designed to provide desired properties of asynchronicity, coherency and freshness. We used the features of the type system, including linear and dependent types...

متن کامل

Lightweight Task Analysis for Cache-Aware Scheduling on Heterogeneous Clusters

We present a novel characterization of how a program stresses cache. This characterization permits fast performance prediction in order to simulate and assist task scheduling on heterogeneous clusters. It is based on the estimation of stack distance probability distributions. The analysis requires the observation of a very small subset of memory accesses, and yields a reasonable to very accurat...

متن کامل

Reconfigurable Discrete Wavelet Transform Processor for Heterogeneous Reconfigurable Multimedia Systems

In this paper, a novel reconfigurable discrete wavelet transform processor architecture is proposed to meet the diverse computing requirements of future generation multimedia SoC. The proposed architecture mainly consists of reconfigurable processing element array and reconfigurable address generator, featuring dynamically reconfigurable capability where the wavelet filters and wavelet decompos...

متن کامل

Holistic Approach to Design Heterogeneous Reconfigurable Systems

Existing hardware software (HW-SW) co-design methodologies mainly focus on HW-SW co-simulation to build a virtual prototyping environment that enables software design and system verification without need of making a hardware prototype. However, it fails to reduce the gape between design productivity and system complexity. It is required to target additional design problems such as design space ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Systems Architecture

سال: 2022

ISSN: ['1383-7621', '1873-6165']

DOI: https://doi.org/10.1016/j.sysarc.2022.102398